Sethos: the UPC speech understanding system
نویسندگان
چکیده
In EuroSpeech’95, we presented the first version of Sethos, the speech understanding system which has been developed at the UPC. In this paper some improvements are incorporated at different levels of Sethos: language model, models of the semantic units and acoustic models. These improvements increase the percentage of correctly decoded sentences from 60% to 80%. Some experiments are presented to evaluate the influence of each information source on the final performance. Furthermore, the computational cost is analyzed arriving to an important conclusion: the configuration which gives the best performance is also the less expensive. The reason is that as better is the modeling, narrower is the beam of the search.
منابع مشابه
The UPC text-to-speech system for Spanish and catalan
This paper summarizes the text-to-speech system that has been developed in the Speech Group of the Universitat Politècnica de Catalunya (UPC). The system is composed of a core and different interfaces so that it is compatible for research, for telephone applications (either CTI boards or standard ISDN PC cards supporting CAPI), and Windows applications developed using Microsoft SAPI. The paper ...
متن کاملSpeaker Diarization for Conference Room: The UPC RT07s Evaluation System
In this paper the authors present the UPC speaker diarization system for the NIST Rich Transcription Evaluation (RT07s) [1] conducted on the conference environment. The presented system is based on the ICSI RT06s system, which employs agglomerative clustering with a modified Bayesian Criterion (BIC) measure to decide which pairs of clusters to merge and to determine when to stop merging cluster...
متن کاملOgmios: The UPC Text-to-Speech synthesis system for Spoken Translation
This paper presents the baseline text-to-speech system developed at UPC (Ogmios) plus our recent work on speech prosody generation and the procedures to create high quality language resources for speech synthesis. These contributions have been evaluated within the TC-STAR European project, which is focused on speech-to-speech translation. Several presented contributions have been developed in o...
متن کاملUPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task
The UPC system works by extracting monomodal signal segments (face tracks, speech segments) that overlap with the person names overlaid in the video signal. These segments are assigned directly with the name of the person and used as a reference to compare against the non-overlapping (unassigned) signal segments. This process is performed independently both on the speech and video signals. A si...
متن کاملThe UPC TTS System Description for the 2007 Blizzard Challenge
This paper presents the evaluation of Ogmios, the UPC TTS system carried out within the Blizzard Challenge Initiative, 2007. Ogmios is a unit-selection based system. Prosodic models are used to select the units using acoustic measures in the target cost but the selected units are not modified. Most of the modules of Ogmios rely on data driven techniques. This evaluation confirms that this frame...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996